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ABSTRACT 

We carried out the largest (> 3.5 x 10^ Mpc^, 26 deg^) Ha narrow band survey to date at z ^ 
0.2 in the SA22, W2 and XMMLSS extragalactic fields. Our survey covers a large enough vol¬ 
ume to overcome cosmic variance and to sample bright and rare Ha emitters up to an observed 
luminosity of ^ erg s“^ equivalent to ^ IIM© yr“^. Using our sample of 220 sources 
brighter than > 10"^^ erg s“^ (> IMq yr“^), we derive Ha luminosity functions, which are 
well described by a Schechter function with 0* = Mpc“^ and 

erg s“^ (with a fixed faint end slope a = —1.35). We find that surveys probing smaller vol¬ 
umes 3 X 10^ Mpc^) are heavily affected by cosmic variance, which can lead to errors of 
over 100 per cent in the characteristic density and luminosity of the Ha luminosity function. 
We derive a star formation rate density of Psfrd = 0.0094 zb 0.0008 Mq yr“\ in agreement 
with the redshift-dependent Ha parametrisation from Sobral et al. (2013). The two-point cor¬ 
relation function is described by a single power law a){Q) = (0.159 ± 0.012) 
corresponding to a clustering length of rp = 3.3 ±0.8 Mpc/h. We find that the most lumi¬ 
nous Ha emitters at z ^ 0.2 are more strongly clustered than the relatively fainter ones. The 
Ha emitters at z ^ 0.2 in our sample reside in ^ Mq dark matter haloes. 

This implies that the most star forming galaxies always reside in relatively massive haloes 
or group-like environments and that the typical host halo mass of star-forming galaxies is 
independent of redshift if scaled by LHa/f-Ha(^)’ proposed by Sobral et al. (2010). 

Key words: galaxies: luminosity function, mass function, galaxies: evolution, galaxies: for¬ 
mation, cosmology: large-scale structure of Universe 


1 INTRODUCTION 

The star formation (SF) activity in the Universe was significantly 
higher in the past, reaching a peak ^ 10—11 Gyrs ago (z ^ 2 — 3, 
e.g. Lilly et al. 1996; Karim et al. 2011; Bouwens et al. 2011; Gu- 
nawardhana et al. 2013; Sobral et al. 2013; Bouwens et al. 2015), 
and with the typical star formation rate (SFR) of galaxies (SFR*) 
at z ^ 2 being a factor ^10 times higher than at z = 0 (Sobral et 
al. 2014). However, the understanding of how and through which 
physical mechanisms the typical SFRs of galaxies have declined 
over the last 11 Gyrs is still poor. 

In order to study SF across cosmic time, a number of tracers 
can be used. Ultra violet (UV) data can be used to trace radiation 
coming from massive, short-lived stars. Dust heated by the UV ra¬ 
diation emits in the far infra-red (FIR). The radiation from the mas- 
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sive stars also ionises the surrounding gas and leads to numerous 
recombination lines such as Ha (6563A) and [Oil] {3121 k). Ra¬ 
dio observations can be used to trace emission from super nova 
remnants. However, it is not trivial to combine these SF indicators, 
given they trace different phases of SF (averaged on short, ^10 
Myr, or long, ^ 100 Myr, timescales, dust obscured, etc.), with dif¬ 
ferent selection functions. Some selections are significantly biased: 
UV-selected samples miss dusty/metal enriched star forming galax¬ 
ies, while the FIR exclusively selects dusty star-forming regions. 
Therefore, one of the main challenges in obtaining a complete pic¬ 
ture of the SF evolution is the direct comparison of equally selected 
large samples of SF galaxies at a range of redshifts. Samples at high 
redshift tend to be obtained with a completely different selection 
than those at lower redshift, which can result in misinterpreted evo¬ 
lutionary trends which are more likely connected with the different 
selections at different redshifts than the actual evolution of galaxies 
across time (e.g. Stott et al. 2013). 

An effective way of overcoming such limitations is by using a 
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Table 1. Area and volumes covered by the narrow band observations. Only 
the common area between the two filters is listed. The same area is used to 
calculate the co-moving volume. 


Field 

No pointings 

Area 

z 

Volume 



deg^ 


lO'* Mpc^ 




0.19 

7.5 

SA22 

24 

6.1 




0.22 

9.8 

W2 

12 

3.6 

0.19 

4.4 

0.22 

5.7 




0.19 

3.9 

XMMLSS 

13 

3.1 




0.22 

5.0 

Total 

49x2 

12.8x2 


36.3 


single technique and a single SF indicator up to the peak of the star 
formation activity. This can be achieved by tracing the Ha emis¬ 
sion line, which is one of the most sensitive and well-calibrated SF 
traces and also benefits from low intrinsic dust extinction within the 
host galaxy (when compared to e.g. UV). Ha surveys performed 
using the narrow-band (NB) technique can provide clean, large and 
complete samples SF galaxies (c.f. Oteo et al. 2015). 

A successful example of the NB technique put into practice 
is the High Redshift Emission Line Survey (HiZELS, Geach et al. 
2008; Best et al. 2010; Sobral et al. 2013), but also see the pioneer¬ 
ing works of Bunker et al. (1995), Moorwood et al. (2000), Kurk et 
al. (2004), Ly et al. (2007) and Shioya et al. (2008). At z - 1 - 2, 
the volumes probed by HiZELS over a number of different fields 
(^5 — 10 deg^) virtually overcome cosmic variance (Sobral et al. 
2015b). However, at z < 0.4, the volumes probed over 1—2 deg^ 
areas are only a minor fraction of those at high-redshift. Indeed, 
the samples at low redshift are greatly limited by cosmic variance, 
and even the widest surveys (e.g. Shioya et al. 2008, Cosmologi¬ 
cal Evolution Survey (COSMOS)) struggle to reach the character¬ 
istic Ha luminosity additional limitation is saturation, 

which means missing the luminous population of Ha emitters (with 
>1 — 3 Mq yr“^, for discussion of this effect see Stroe et al. 2014). 
This can lead to an underestimation of Ha luminosity function (LE) 
bright end and an exaggeration of the evolution of from high 
to low redshift. 

The combination of all these issues and the different selection 
techniques applied by each study makes it extremely hard to fairly 
compare between z < 0.4 and z > 1 samples when based on the 
same surveys. While it is possible to use other samples at lower 
redshift (e.g. spectroscopic selection, Gunawardhana et al. 2013), 
the importance of using the same selection in order to obtain clean 
and clear evolutionary trends cannot be stressed enough: without 
the guarantee of a unique selection, any evolutionary trends be¬ 
come hard/impossible to understand and interpret, limiting our un¬ 
derstanding. 

In order to overcome the current shortcomings we clearly re¬ 
quire a large Ha survey at lower redshifts which can be directly 
matched to higher redshift. In this paper we present a large survey 
at z ^ 0.2, covering a similar co-moving volume (3.5 x 10^ Mpc^, 
spread over 3 independent fields to overcome cosmic variance) and 
complete down to similar luminosity limits relative to as sur¬ 
veys at z > 1. The structure of the paper is as follows: in §2 we 
present the observations and the reduction of the narrow-band data, 
while in §3 we show the selection of the Ha emitters. §4 deals with 
the z ~ 0.2 Ha luminosity function and §5 the clustering of bright 


Table 2. Typical 3cr limiting magnitudes for the three fields (including the 
standard spread in values), for each filter. The depth for each pointing (and 
within each CCD of out of the four WFC CCDs) varies across the fields 
over the ranges reported in the third and last column. 


Field 

Filter 

3a 



mag 

SA22 

NBl 

NB2 

ip- > 

1 + 1 + 

o oo o 

W2 

NBl 

NB2 

16-711 

XMMLSS 

NBl 

NB2 

17 7 + 0-4 
^ + '- 0.3 

17 5+0.5 
^ +^-0.3 


Ha sources and the implications of our results for the cosmic SE 
evolution are presented. We present concluding remarks in §6. 

At the two redshifts probed, z 0.19 and 0.22, 1 arcsec cov¬ 
ers a physical scale of 3.2 kpc and 3.6 kpc, respectively. The lumi¬ 
nosity distance is Jp ~ 940 Mpc at z ^ 0.19 and ^1110 Mpc at 
z = 0.22. All coordinates are in the J2000 coordinate system. We 
use the Chabrier (2003) initial mass function (IME) throughout the 
paper, and results from other studies are also converted to this IME. 


2 OBSERVATIONS & DATA REDUCTION 

We obtain NB data tracing Ha at z ^ 0.19 and ~ 0.22 in three 
well studied extragalactic fields located at high Galactic latitude. 
W2 is part of the Canada-Erance-Hawaii Telescope Legacy Sur¬ 
vey (CEHTLS) 155 deg^, wide and shallow survey (Gwyn 2012), 
aimed at studying the large scale structure and matter distribution 
using weak lensing and galaxy distribution. S A22 is part of the W4 
field in CEHLS and multiwavelength data has been compiled by 
Matthee et al. (2014) and Sobral et al. (2015b). The XMM Large 
Scale Structure Survey (XMMLSS, Pierre et al. 2004) is aimed 
at mapping large scale structures through clusters and groups of 
galaxies. 

2.1 Narrow band Ha observations 

We obtained narrow band data using the NOVA782HA and 
NOVA804HA (Stroe et al. 2014, 2015; Sobral et al. 2015a) filters 
on the Wide Wide Eield Camera (WEC)^ mounted on the Isaac 
Newton Telescope (INT, I13BN008, PI Sobral) Eor brevity, we 
label the filters as NBl (NOVA782HA) and NB2 (NOVA804HA). 
Given the central wavelengths of the filters are 7852.4A and 
8036. 15A, with a full-width-half-maximum (FWHM) of IlOA, 
the two filters trace Ha emission in the z = 0.1865 — 0.2025 and 
z = 0.2170 — 0.2330 redshift ranges. Note that given the large field 
of view of WEC, a slight blue shift in the filter central wavelength 
is expected at large off-axis distances. However, given the WEC fo¬ 
cal ratio (//3.29), this effect is expected to be very low (a few per 
cent Bland-Hawthorn et al. 2001). Sobral et al. (2015a) and Stroe et 
al. (2015) characterised the filters with spectroscopy from the Keck 
and Wiliam Herschel Telescopes with sources located both towards 

^ http://WWW.ing.iac.es/engineering/detectors/ultra_wfc. 
htm 

^ http://www.ing.iac.es/Astronomy/telescopes/int/ 
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Figure 1. Colour-colour plots for the SA22, W2 and XMMLSS fields, mainly used to remove stars. The first plot shows g — r versus J — Ks while the middle 
and last plots show g — r versus r — i. We first separate stars and emitters using the g — r versus / — Ks, and the apply an extra cut using the optical colours 
to further remove stars with absorption features in one of the filter. The solid red and black lines display the colour cuts used to select point-like objects. Ha 
emitters are plotted in red crosses, while point-like sources are plotted as stars. 4000A break galaxies are plotted in green crosses and high redshift sources in 
purple crosses. 


and away from the pointing centre and found that the redshift distri¬ 
bution of Ha emitters matches that expected from the filter profile, 
without any noticeable offset. 

Observations were conducted in five bright nights, between 22 
and 26 of October 2013, under ^ 1 arcsec seeing conditions. A five- 
position dither pattern was employed for the individual exposures 
(of 600 s each) to cover the spacings between the four WFC CCDs. 
Forty-nine individual pointings (of ~ 0.3 deg^ each with WFC) 
split between the three fields (SA22, W2 and XMMLSS) cover an 
area of almost 13 deg^ at each of the two redshifts (thus an effective 
area of ^ 26 deg^ combined), tracing a total co-moving volume of 
about 3.63 X 10^ Mpc^. The overlap with the multiwavelength data 
extends to about 10 deg^ per redshift. 

2.2 Narrow band data reduction 

We reduce the data using the PYTHON based pipeline described in 
Stroe et al. (2014). In short, we median combine the sky fiats and 
biases and use the stacks to correct the science data. After detecting 
sources using the SEXTRACTOR package (Bertin & Arnouts 1996) 
and masking them in each science exposure, we median combine 
the exposures to obtain a ‘super-fiat’. We divide the data through 
the ‘super-fiat’ to correct for ‘fringing’. We then use SCAMP 
(Bertin 2006) to find astrometric solutions for the science expo¬ 
sures. The exposures are normalised to the same zero-point (ZP) by 
comparison to the red magnitude in the fourth United States Naval 
Observatory (USNO) Catalog (UCAC4; Zacharias et al. 2013). We 
combine the processed data into final stacked images using S Warp 
(Bertin et al. 2002). We photometrically calibrate our data against 
the i band magnitude from the Sloan Digital Sky Survey (SDSS) 
Data Release 9 (SDSS DR9 Ahn et al. 2012), which covers all our 
fields (SA22, W2 and XMMLSS). We extract magnitudes within 
5 arcsec apertures using SEXTRACTOR (Bertin & Arnouts 1996). 
This corresponds to a physical diameter of 18 kpc at ~ 0.2 red¬ 
shift. 


We calculate 3(7 limiting magnitudes using the RMS noise re¬ 
ported by SEXTRACTOR (see Table 2). The depth of the obser¬ 
vations varies across the pointings and even between the different 
chips of the WFC. Hence, we calculate the RMS noise individually 
for each CCDs, for each pointing, across the three fields. 

We apply the NB technique to select line emitters, using a 
NB filter tracing line emission within a narrow range in redshift, in 
combination with another NB or broad band (BB) filter used for the 
estimation of the continuum emission underlying the emission line. 
We use two NB filters to trace Ha emission in two redshift ranges 
(0.1865-0.2025 and 0.2170-0.2330). For each NB filter, we use 
the other NB filter to estimate the continuum BB emission. In this 
way, for line emitters, one NB filter captures the BB emission as 
well as the line emission, while the other NB filter only captures 
the stellar continuum emission. Our method is similar to that of 
Dale et al. (2010), who use twin NB filters for continuum subtrac¬ 
tion. In further text, we use labels according to the filter which was 
used as NB filter in that particular case. Therefore, when we label 
with NBl, we refer to line-emitters in the 0.1865 — 0.2025 redshift 
range, while NB2 refers to the 0.2170 — 0.2330 range. The details 
of the selection method are laid down in §3.2. 

2.3 Optical and IR data 

In our analysis, we use the rich multi-wavelength optical and infra¬ 
red (IR) data available for the SA22, W2 and XMMLSS fields. 

All three fields are part of the CFHTLS wide and shallow sur¬ 
veys (SA22, W2 and XMMLSS are in fields W4, W2 and Wl). 
We make use of the g, r, i and z photometry (Erben et al. 2013) 
and photometric redshifts (Ilbert et al. 2006) available through the 
CEHTLS T0007 release. 

We also employ near IR data in the J and Ks filters, down 
to magnitude ^ 21.2 and ^ 20.0 respectively, obtained as part of 
the Visible and Infrared Survey Telescope for Astronomy (VISTA) 
Hemisphere Survey (VHS, McMahon and the VHS Collaboration, 
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Figure 2. Colour-magnitude diagrams showing the excess as function of NB magnitude. The selection is performed separately for each CCD/pointing, field 
and NB filter, using the other NB filter for continuum estimation. Each panel is labelled with the corresponding field and the filter which is used as NB. The 
curves show average 3E colour significances for the average depth, as the RMS value varies between the pointings and CCDs. The horizontal dashed, black 
lines represent the intrinsic EW cuts. Note that we correct for incompleteness arising from our slightly different EW and colour significance cuts. 


MNRAS 000, 000-000 (0000) 


























A large narrow band Ha survey at 0.2 5 


Table 3. Number of line emitters and Ha emitters selected in each field and 
filter. We also list the average limiting observed Ha luminosity at 50 per 
cent completeness and the equivalent SFR (using equation 11). 


Field 

Filter 

Emitters 

Ha emitters 

log(THa) 

SFR 





(erg s“^) 

(Mq yr-l) 

SA22 

NBl 

153 

59 

41.4 

1.1 

NB2 

238 

91 

41.4 

1.1 

W2 

NBl 

33 

13 

41.4 

1.1 

NB2 

55 

15 

41.6 

1.7 

XMMLSS 

NBl 

51 

23 

41.1 

0.5 

NB2 

50 

19 

41.4 

1.1 

Total 

both 

576 

220 




2012, in preparation). Where available, in the XMMLSS field, we 
preferentially use data from the VISTA Deep Extragalactic Obser¬ 
vations (VIDEO) Survey (Jarvis et al. 2013), which is about 3.5 
magnitudes deeper than VISTA. We also make use of the IR pho¬ 
tometric data taken in the SA22 field as part of the second data 
release of the UKIDSS Deep Extragalactic Survey (Warren et al. 
2007), which reaches magnitudes 23.4 and 22.8 in the J and Ks 
bands, respectively, with a catalogue from Sobral et al. (2015b). 

We make use of the photometric and spectroscopic redshift 
compilation in the UKIDSS Ultra Deep Survey (part of XMMLSS) 
available as part of their 8th data release^, as well as other publicly 
available spectroscopy in the XMMLSS field (Garcet et al. 2007; 
Polletta et al. 2007; Tajer et al. 2007; Melnyk et al. 2013). 


We exclude stars using a colour-colour selection criterion us¬ 
ing optical and IR colours based on Sobral et al. (2012), keeping in 
mind the distribution of sources in the colour-colour diagram. This 
is illustrated in Eigure 1 . 

Red stars are selected using: 

{g-r)>2{J-Ks) + l Sl (g-r)>0.8 & >-0.7 

( 1 ) 

We select dwarf stars via: 

{g-r)> (7/3(r - i) - 2/3) & (g - r) > 1.0 (2) 

Optically blue stars and dwarf stars with absorption features 
are selected by: 

(g - r) > 2(7 -/sTi) + 1 & (g-r)< 0.8 (3) 

We additionally use the ‘StarGaT parameter in the CEHTLS 
photometric redshift catalogue to select stars (Ilbert et al. 2006), 
which categorises sources as point-like or extended objects. 

Thus, in summary, we label sources as stars if: 

• Source passes the red star selection criterion (equation 1) or 

• Source passes the blue star selection criterion (equation 3) or 

• Source passes the dwarf star selection criterion (equation 2) or 

• Source is classified as star by the CEHTLS ‘StarGaT parame¬ 
ter. 

About 60 — 80 per cent of the sources mimicking emission 
lines are marked as stars. Spectroscopic observations using NBl 
and NB2 (e.g Stroe et al. 2015; Sobral et al. 2015a) confirm the 
presence of such stars. All the sources masked as stars are removed 
from catalogues such that they are not selected as line emitters. 


3 METHODS AND SELECTING THE Hoc SAMPLES 

Once sources are detected in the NB images, we cross-match the 
NB catalogues with the optical and IR catalogues presented in Sec¬ 
tion 2.3, using a 1 arcsec positional tolerance. Note that because 
the BB catalogues are deeper than our data by at least 2 mag, we 
have 100 per cent optical and IR coverage in the areas we have 
EOV overlap with all the multiwavelength data. We use each NB 
catalogue as base catalogue for the cross-match. 

3.1 Star removal 

As explained in §2.2, we use the two NB filters to trace Hoc emis¬ 
sion at two redshifts ranges (0.1865 — 0.2025 and 0.2170 — 0.2330). 
However, given the wavelength coverage of the two adjacent filters 
our samples of line emitters is contaminated by stars (see also Stroe 
et al. 2014). Stars could mimic having an emission line if they have 
extremely red or a broad absorption feature, which would lead to a 
strong colour between the two NB filters. We expect the line emit¬ 
ters selected in the NB2 filter to be particularly contaminated with 
a population of (L, M) dwarf stars (Kirkpatrick et al. 1991, 1999). 
They will be selected as having excess in NB2 because their con¬ 
tinuum has a broad absorption feature falling within the NB 1 filter, 
leading to an underestimation of the continuum emission. The ex¬ 
tremely red BB colours of these sources are also consistent with 
them being red dwarfs. 

^ http://WWW.nottingham.ac.uk/astronomy/UDS/data/data. 
html 


3.2 Selection of line emitters 

We use the formalism developed by Bunker et al. (1995), which is 
widely used in the literature (e.g. Shioya et al. 2008; Sobral et al. 
2009; Stroe et al. 2014) to select large numbers of line emitters. 
We refer the interested readers to those papers for the details of the 
selection criteria. 

We select line emitters separately in each field and each NB 
filter. Eor brevity, in the following equations, we label the filter 
used to select emitters as NB, while we name the other NB filter, 
used to quantify the continuum emission, as BB filter. Note that we 
attempted the selection of line emitters using the i band filter, fol¬ 
lowing Stroe et al. (2014) and Stroe et al. (2015). However, the rel¬ 
atively deep CEHTLS data becomes saturated at magnitude 17 — 18 
and would prevent the selection of bright line emitters. Therefore 
using each NB filter for continuum subtraction of the other repre¬ 
sents the optimal strategy, enabling the selection of line emitters up 
to magnitude 10. Using much deeper broad band i data would al¬ 
low us to probe down to fainter emitters, but our aim for the paper 
is to study the bright population. By comparison, the widest Ha 
survey at z 0.2 to date, performed by Shioya et al. (2008), can 
only probe sources as bright as ^ 18 mag, but excels at the faint 
end (going down to 24 mag). 

We select emitters in each NB filter based on their excess 
emission compared to the BB emission (quantified using the other 
NB filter). We first correct for any systematic colour offset between 
the two NB filters. Colour is defined here as the difference in mag¬ 
nitude between the filter used as NB and the filter used to measure 
broad band. We estimate a median offset of this colour, based on 
the scatter in the colours at non-saturated, but still bright NB mag¬ 
nitudes. We then apply this correction to the colour and the NB 
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Figure 3. Photometric redshift distribution of line emitters for each field. 
Note the quality of the photometric redshifts varies between the fields. The 
top panel shows the main line we expect to capture with out two narrow 
band filters. The distribution contains clear peaks around z ~ 0.2, indicating 
our sample is dominated by Ha emitters, with little contamination from 
higher redshift emitters. 

magnitude. However, because the filters are close in wavelength 
this correction is small (0.02 and 0.03 mag, for NBl and NB2 re¬ 
spectively). 

The excess emission is then quantified through the colour ex¬ 
cess significance E, which is used to separate sources with real 
colour excess, compared to excess caused by random scatter (So- 
bral et al. 2009; Sobral et al. 2012): 

iQ-0.4(mBB-mArB) 

; (4) 

l(^-QA{ZPAB-mNB) ^ (^(j2^ -P (j2^ j 

where ZP^b is the magnitude system zero-point, tunb and tubb are 
the NB and BB magnitudes (where NB is the filter used for detec¬ 
tion of line emitters and BB is the other NB filter used for quan¬ 
tifying the continuum emission), r is the radius of the aperture in 
pixels and Onb and Obb are the rms noise levels. 

The NB or BB fiux /nb^bb are calculated as: 

, (5) 

^NB,BB 

where c is the speed of light, Anb and Abb are the central wave¬ 
lengths of the two NB filters and ZP^b = 48.574 is the ZP of the 
AB magnitude system. The line fiux is: 

^line = {InB - /bB ) • (6) 

Note that the two filters are independent, hence there is no over¬ 
lap in wavelength between NBl and NB2. Therefore, if one filter 
captures line emission on top of the continuum, automatically the 
other NB filter picks up only continuum emission. Therefore, the 
line fiux formula accounts for the fact the filter used as BB does 
not contain any line emission. 

We use the E parameter in conjunction with an equivalent 
width (EW) cut, which ensures that we select only sources which 


■ Emitters with spectroscopy ■ Selected as Her 



Zphot 


Figure 4. Photometric versus spectroscopic redshift for sources selected 
as emitters. The shaded red area indicates sources which based on their 
photometric redshift were selected as Ha. The yellow shaded area indicates 
the redshift range captured by the filters. 

have a ratio of the line to continuum fiux larger than the scatter at 
bright magnitudes. The observed EW is defined as: 

EW = AAnb -^^V^^^ ’ 

JBB 

where AAnb = 100 A is the FWHM of the NB filters, while fyB 
and /bb are the NB and continuum fiuxes. Note this formula is a 
simplified version of those presented in, e.g.. Bunker et al. (1995) 
and Sobral et al. (2009), because we do not expect our BB filter to 
contain any emission line flux. 

In the restframe of the sources, the intrinsic EWq is: 

EWo = EW/(l+z). (8) 

In conclusion, we select sources as emitters if: 

• Their colour significance E is higher than 3 and 

• Their equivalent width is higher than 3(7, where a is the colour 
excess (BB-NB) scatter at bright, but not saturated magnitudes. 

The E = 3 colour significance and the 3(7 excess depend on the 
depth of the observations in each field (See Figure 2). We choose 
to not impose a single, common cut, to follow the natural depth of 
the data, rather than cutting the sample at excessively high EW and 
E. However, we note that we fully correct for the sources missed by 
our cuts, as explained in §4.1. 

3.3 Selection of Ha candidates 

The line emitter population is made of Ha emitters at z ^ 0.2, 
as well as higher redshift line emitters: H/3 (Arest = 4861 A), 
[Olll]AA4959,5007 emitters at z ^ 0.61 —0.65 and [Oil] (Arest = 
3727 A) emitters at z ^ 1.15 (see Figure 3). Our sample could be 
contaminated by a population of 4000 A break galaxies at z ^ 0.8. 
As shown in Stroe et al. (2014), at 8000 A and lower line fluxes, 
the line emitter population is dominated by [OH]A3727 emitters 
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and z ^ 0.8 4000 A break galaxies. However, at high fluxes, the 
number of Ha and H/3/[OIII] steeply rises, each amounting to 
about 50 per cent of the line emitter population. Therefore, given 
the shallow depth of our survey, we are strongly biased against de¬ 
tecting high-redshift (z > 0.6) sources. We expect the Ha emitters 
to amount to about half of the emitter population. Figure 3, pre¬ 
senting the photometric redshift distribution of the line emitters, 
confirms these findings. The steps we undertake to robustly sepa¬ 
rate the Ha emitters from the other sources are described in the 
following paragraphs. 

We first visually inspected all line emitter candidates to flag 
any spurious sources coming from noisy edge regions of the chips 
or from false detections within the haloes of bright sources. 

Ha emitters are selected in the following way: 

• The photometric or spectroscopic redshift of the source does 
not lie in the expected ranges for H/3/[Olll]/[Oll] emitters (0.37 < 
z < 0.7 and 0.9 < z < 1.2) and 4000 A break galaxies (0.7 < z < 
0.9) and 

• The photometric or spectroscopic redshift of the source lies in 
the 0.15 < z < 0.25 range. 

Figure 1 displays the colour-colour distribution of line emitters, the 
cut employed to separate the source types and highlights the loca¬ 
tion of the Ha emitters. All three fields and both Alters are shown 
in the same plot. Separating the data per held and Alter results in 
colour-colour diagrams which are consistent with Figure 1, indi¬ 
cating there are no systematic differences between the populations 
selected with the two NB Alters. The number of Ha emitters se¬ 
lected in each held can be found in Table 3, amounting to a total of 
220 Ha emitters. This amounts to almost 40 per cent of the total 
number of emitters, as expected and explained in §3.3. 


3.3.1 Purity of the Ha sample 

We compare the spectroscopic and photometric redshifts in order to 
study the purity of the Ha sample (Figure 4). We And that the pho¬ 
tometric redshifts are within 0.05 of the spectroscopic ones. From 
the sources spectroscopically confirmed to be at lower or higher 
redshift, none make it into the Ha catalogue, implying a very low 
contamination. Note that the range we used for selecting sources 
as Ha from photometric redshifts is 0.15 — 0.25, which is large 
enough to capture Ha emitters in both Alters, while minimising 
contamination. Out of 12 spectroscopically conflrmed emitters we 
miss two sources, implying completeness higher than 80 per cent. 
However, the spectroscopy is limited and the low number statistics 
could lead to an overestimation or underestimation of the complete¬ 
ness and contamination. Future spectroscopic observations will al¬ 
low us to further investigate this. 


4 Ha LUMINOSITY FUNCTION AND 
STAR-FORMATION RATE DENSITY 

We use the sample of 220 Ha sources to build luminosity functions. 

Our Alters are sensitive not only to Ha, but also to the adja¬ 
cent [Nil] double (6450 and 6585 A) forbidden line. We subtract 
the [Nil] contribution from the line fluxes using the method from 
Sobral et al. (2012) to obtain Ha fluxes (Fh^), which has been 
spectroscopically conflrmed by Sobral et al. (2015b). The average 
[Nil] contribution is about 30 per cent of the total line flux. 

After we obtain pure Ha fluxes Fh^, we calculate the Ha 


luminosity Lh^ : 

= 47rj|(z)FHa, (9) 

where di^{z) is the luminosity distance (940 Mpc for the NBl Alter 
and 1110 Mpc for NB2). 

4.1 Completeness, volume and filter profile corrections 

We use the method of Sobral et al. (2012) to correct for the incom¬ 
pleteness arising from missing sources with faint Ha fluxes and/or 
low EW. We select random samples of sources passing the selec¬ 
tion criteria for being located at the redshifts traced by the two Al¬ 
ters, but which are not selected as Ha emitters. Fake Ha emission 
lines are added to these sources which are then passed through the 
Ha selection criteria {EW and Z) described at the end of §3.3. 

Because of the different depth between the pointings and be¬ 
tween the four CCD chips, we independently study the recovery 
rate as function of the Ha flux for each chip, pointing. Alter and 
fleld. The results of the completeness study can be found in the 
Appendix in Figure Al. Our results are corrected for the effects of 
incompleteness, especially the Ha luminosity function (see §4.3, 
§4.4 and, e.g.. Figures 5, 6 and 7). 

The volumes probed in each fleld and at each redshift assum¬ 
ing that the Alters have a perfect top-hat shape are listed in Table 1 . 
The total co-moving volume probed is 3.63 x 10^ Mpc^, by far the 
largest volume ever surveyed in Ha at z ~ 0.2. However, since the 
Alter transmission does not follow perfectly an idealised top hat, we 
follow the method of Sobral et al. (2009) and Sobral et al. (2012) 
and correct the volumes to account for sources missed at the edges 
of the Alter. 


4.2 Survey limits 

A 50 per cent completeness (see Figure Al) translates to average 
limiting Ha luminosities of erg s~^ for our survey. This 

is equivalent to limiting star formation rates (SFR) of 0.5 — 1.8 Mq 
yr“^, with no intrinsic dust extinction applied. If 1 magnitude of 
dust extinction is applied this is equivalent to 0.2 — 0.8 SFR* (see 
equation 11 in §4.4). 

The maximum observed Ha luminosity our survey probes is 
~ lO'^^-^ erg s“^ equivalent to SFRs of 11 Mq yr“^ (or > 27 Mq 
yr“^ if 1 mag of dust extinction is applied). By comparison, the 
widest Ha survey at a similar redshift, performed by Shioya et al. 
(2008), reaches ^ 10^^-^ erg s“^ or 3.5 Mq yr“^ (8.7 Mq yr“^ 
with dust extinction). This means our survey probes galaxies more 
than three times more star forming than previous surveys. 

4.3 Ha luminosity function 

Using our flnal sample of Ha emitters, we build luminosity func¬ 
tions (LF) which characterise the density of sources at any given 
Ha luminosity. To do so, we bin sources based on their luminosity 
(corrected for the [Nil] contribution, §4, but not for intrinsic dust 
extinction), by adding their associated inverse co-moving volume, 
corrected for the real Alter profile and incompleteness (as shown in 
§4.1). 

We fit the binned data with a Schechter (1976) parametrisa- 

tion: 

0(FHa)dLHa = 0* e OO) 
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Figure 5. The Ha luminosity function at z ~ 0.2 from our study and the best fit Schechter function. The L^a is not corrected for intrinsic dust attenuation. For 
comparison, the results from Shioya et al. (2008) are also shown. Note the excellent agreement between the data in the overlapping luminosity range. However, 
our data probes brighter luminosities, enabling the first determination of the bright end of the Ha luminosity function at z ~ 0.2. 


Table 4. Best fit luminosity function at z ~ 0.2 obtained from combining data in the three fields (SA22, W2 and XMMLSS) and two NB filters. Since our data 
is not very deep, but probes the bright-end really well, we fix the faint-end slope a at two values. For comparison, we also list the results and volumes probed 
from other studies at a similar redshift. Note that none of the are corrected for Ha extinction. 


Source 

z 

V (10'* Mpc^) 

a 

log(|)* (Mpc-5) 

log(ergs *) 


~ 0.2 

36.3 

-1.35 

-2.85 ±0.03 

41.71 ±0.02 

This study 

-1.70 

-3.06 ±0.04 

41.83±0.03 

Shioya et al. (2008) 

-0.24 

3.1 

— 1 

^•■^•^-0.13 


41 54+0-38 

^1.0^_0 29 

Ly et al. (2007) 

-0.24 

0.5 

-1.70±0.10 

-2.98 ±0.40 

41.25 + 0.34 

Drake et al. (2013) 

-0.25 

1.2 

— 1 03”*"^'^^ 

9 ro-l-0.17 
Z.0.3_021 

40.83l«:l« 


where is the characteristic Ha luminosity, 0* is the character¬ 
istic density of Ha emitters and a is the faint-end slope of the LF. 
Since our data is not deep enough to properly constrain the faint 
end slope of the LF (see Table 3), we fix a to two values previ¬ 
ously derived in the literature using deep data: —1.35 from Shioya 
et al. (2008) and —1.7 from Ly et al. (2007). In fitting the LFs, we 
assume Poissonian errors. 

Our best fit Ha LF is described by a typical luminosity 
log(L^^) = (41.71 ±0.02) gj.g g-i ^ characteristic density 


log(0*) = Mpc“^ (see Table 4 and Figure 5). Our 

data samples really well the bright-end of the LF, which enables 
us to place tight constraints on 0* and (errors lower than 15 
per cent). However, we lack depth (lowest bin at ~ 10"^^erg s“^), 
so we fix the faint-end slope to —1.35, as obtained by Shioya et 
al. (2008) from the previously widest Ha survey, which benefits 
from high-quality, deep data reaching luminosities of 10^^-^ erg 
s“^, but is limited at the bright end. Therefore, the two surveys are 
highly complementary. Within the overlapping regions with data 
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Figure 6. A range of luminosity functions at z ~ 0.2, from the current work 
and the works of Shioya et al. (2008), Ly et al. (2007) and Drake et al. 
(2013). In shaded areas, we overplot the ranges allowed by the 1 cr error bars 
of the LF parameters. The works of Shioya et al. (2008), Ly et al. (2007) and 
Drake et al. (2013) explore the faint end part of the luminosity. The shaded 
areas indicate the Icr uncertainties of the Schechter function parameters. 
Our measurements are consistent with previous work, but significantly im¬ 
prove the previously unexplored bright end. While our measurement error 
is given by cosmic variance, as shown in §4.6. However the other measure¬ 
ment do not include the error given by cosmic variance, which would add 
an error of about 100 — 200 per cent in the parameters. 


from both the Shioya et al. (2008) and our survey, the measure¬ 
ments are in excellent agreement. However, our LF, constrained up 
to Liicc = 10 ^^-^ erg s“^, indicates a slightly larger value of 
but still consistent with Shioya et al. (2008) within their large error 
bars (see Figure 6 ). Note that their uncertainties do not include the 
error from cosmic variance, which can results in 100 — 200 errors in 
the parameters of the LF (see §4.6). Any discrepancy between the 
results can be explained by cosmic variance, given Shioya’s vol¬ 
ume is ^ 10 times smaller than ours and probes a single field. The 
differences between the 0 * results could also be explained by the 
different colour-colour methods used to separate the Ha emitters 
from higher redshift line emitters. 

The discrepancy with other studies is much larger however 
(see Figure 6 ). Compared to our results, Ly et al. (2007), slightly 
overestimate 0* (not significant) and underestimate (at the 2(7 
level). Drake et al. (2013) obtain an which is highly underes¬ 
timated (lO'^^-^^ erg s“^). The difference to our value is significant 
at the 11(7 level. This is entirely driven by Drake’s small volume 
('^ 30 times smaller than ours) and the long exposures they were 
using in their study which prevented the study of sources brighter 
than 20 mag in the NB filter. Given the large variations in the LF 
parameters from cosmic variance, we expect all theses results to be 
consistent with our measurement, once the cosmic variance error is 
folded in (see §4.6). 


4.4 Star formation rate density 

We can calculate the star formation rate density (SFRD) at z ^ 0.2 
by integrating the luminosity function and converting Ha luminos¬ 
ity to SFR. We use the Lh^ to SFR conversion from Kennicutt 



Figure 7. Evolution of the SFRD from z~2.23toz~0.2. Our measure¬ 
ment at z ~ 0.2 confirms the previously discovered decline in SFRD, which 
can be simply parametrised as a function of redshift (Sobral et al. 2013). 

(1998), corrected for the Chabrier (2003) IMF: 

SFi?(M 0 yr-l) =4.4x 10-42LHa(ergs-l). (11) 

The luminosity density is obtained by integrating the Ha LF: 

pLna=J^ ^{Lua)Lua^Lua ( 12 ) 

= r(a + 2)(^*L^«, (13) 

where r(w) = (w — 1)! is the Gamma function. By converting from 
luminosity to SFR through equation 11, the SFRD pspR is: 

PSFR = r(a + 2)0*L^„loO-4^H“(l -/agn) (14) 

where Ana is the intrinsic Ha dust extinction which we assume to 
be 1 mag and /agn = 0.15 is the fraction of the Ha luminosity 
expected to be due to contributions from broad line and narrow line 
AGN emission (e.g. Garn & Best 2010; Sobral et al. 2015a). 

Our measurement of the SFRD, psFRD = 0.0094 ± 0.0008 
M 0 yr“^Mpc“^, which matches with the value of Shioya et al. 
(2008) (0.010 ±0.006 Moyr-^ Mpc”^). Sobral et al. (2013) de¬ 
rive a redshift-dependent parametrisation of the SFRD (pspRD = 
—2.1/(1 ±z) ± log 10(4.4/7.9), corrected for the Chabrier IMF) 
based on their measurements and results from Ly et al. (2007) at 
z ^ 0.08 and Shioya et al. (2008) at z ^ 0.24 (see Figure 7). Our 
measurement perfectly agrees with the parametrisation, which pre¬ 
dicts a value of 0.01 at z ^ 0 . 2 . 

4.5 Distribution of Ha emitters 

Figure 8 shows the distribution of the Ha emitters in the three fields 
at the two redshifts, as selected in §3.3. Note the high degree of 
cosmic variance within and between the field and at the adjacent 
redshifts. 

On average, down to a limiting Ha luminosity of 10^^'^ 
erg s~^ or SFR ^ 1 Mq yr“^, we find ^ 2 Ha emitters per square 
degree (or ^ 3 per Mpc^). However, there are large areas with no 
emitters, while parts of the W2 and XMMLSS fields have densi¬ 
ties of up to 20 sources per square degree. The ‘Sausage’ massive, 
young post-merger galaxy cluster Stroe et al. (2014, 2015), where 
Ha emitters were selected with the NBl filter, was found to be 
extremely dense in star-forming galaxies and AGN, compared to 
blank fields. Down to the faintest Ha luminosities as our current 
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Figure 8. Smoothed sky distribution of the Ha emitters. Note the amount of cosmic variance within the fields. On average 2 emitters are found per deg^, but 
the values vary between 0 and 5 sources per deg^. 
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Figure 9. The values of LF Schechter parameters (j)* and when we fix 
a = —1.35. For fitting the LF, we create 1000 random sub-samples of Ha 
emitters, at a range of probed volumes. The data points are colour coded 
with the co-moving volume probed in units of 1000 Mpc^. Note how at 
small volumes the scatter is the value is extremely large (up to 4 — 5 dex), 
while at large volumes the values for (j)* and converge. We obtain sim¬ 
ilar results with a different value of a or when we use the data for the two 
filters separately (see Figure Bl) 



Figure 10. Distribution of the number of Ha emitters randomly selected 
within a range of volumes. As expected the larger the volume, the larger 
the number of sources, with a spread at each volume size caused by cos¬ 
mic variance. The Poissonian error relative to the mean number of sources 
does not dominate over the spread caused by cosmic variance, except where 
cosmic variance is minimised through the sampling of a large volumes. 
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Figure 11. The error distribution of the characteristic Ha luminosity and number density (j)*, as function of the volume probed. The error is calculated as 
htted value minus the mean of the distribution at each volume. The results are obtained when combining data from both NB filters, with faint end slope fixed 
to —1.35 (see Figures B2, B3 and B4 for results for other a and for the two filters independently). At each volume, 1000 realisations are performed, based 
on random samples of sources. Each figure shows the values obtained from the LF fitting in gray-black stripe. Darker colours mean more of the realisations 
found that particular or (j)* value. The violin plot next to each stripe encodes the histogram. The top panel shows the standard deviation cr of the 

^Ha values at each volume size. Note that spread of values drops the larger the volume probed, indicating a convergence in the values of and (^*. 


Table 5. Bin width AlogLna, starting bin logLHa,min and number of bins 
(N) chosen for studying the luminosity function, depending on the volume 
V probed. 


V range 

AlogLna 

logLHoc^min 

Abins 

< 2 X lO'* Mpc^ 

0.3 

41.5 

4 

2 X 10'* - 9 X 10^ Mpc^ 

0.2 

41.5 

4 

9 X 10^ - 18 X 10^ Mpc^ 

0.15 

41.4 

5 

18 X 10^-27 X 10“* Mpc^ 

0.15 

41.4 

6 

> 27 X 10^ Mpc^ 

0.1 

41.4 

8 


data surveys (10^^-^ erg s“^), the density is ^ 140 emitters per 
square degree, about 70 times above the average we find over an 
area of 20 deg^. Assuming Poissonian noise, the ‘Sausage’ cluster 
overdensity is significant at the > 11(7 level. 

The older ‘Toothbrush’ galaxy cluster merger, where the two 
subclusters collided about 2 Gyr ago, behaves differently. The den¬ 
sity is about ^16 emitters per square degree, densities similar to 
the densest parts of our wide, shallow Ha survey. Our results thus 
corroborate the conclusions from Stroe et al. (2014) and Stroe et al. 
(2015). 

4.6 Quantifying cosmic variance 

One of our goals is to understand the impact of cosmic variance 
and low number statistics on the determination of the LF parame¬ 
ters, especially motivated by the differences in LF found with the 


previous studies of Ly et al. (2007) and Drake et al. (2013). We 
generate random subsamples of Ha emitters, probing a range of 
volumes. We perform 1000 realisations starting from the smallest 
volumes for which we can fit a LF, up to the entire volume of our 
survey. We perform this experiment using Ha emitters in each NB 
filter and also combine all the data together, following Sobral et al. 
(2015b). 

The number of sources for each realisation is plotted in Fig¬ 
ure 10. As expected the average number of sources increases with 
the volume surveyed. We calculate the standard deviation of the 
spread in number of sources at each volume and compare that to 
the Poissonian error. In the calculation of the Poissonian error we 
take into account the fact that the sources are divided into bins. 
At very low volumes, the relative Poissonian error dominates over 
the spread in the number of sources, which is caused by cosmic 
variance. Given the depth of our survey, at the very small volumes 
(< 2 X 10^ Mpc^) the Poissonian error essentially goes to infinity. 
Overall, the total relative error, calculated as the sum in quadra¬ 
ture of the Poissonian and cosmic variance error, goes down with 
increasing volume. 

Naturally, when surveying a smaller volume, the number of 
Ha sources is proportionally smaller. We therefore adapt the num¬ 
ber of bins {N), the bin width AlogLna and the starting bin logLna, 
depending on the volume V probed, as detailed in Table 5. 

The results from the different realisations of the LF calculated 
from Ha emitters extracted over a range of volumes can be found 
in Figure 9. At small volumes (< 4 x 10^ Mpc^), the random reali- 
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□ Data (this work) - - - Fit, log(LHa)>41.55 (this work) 

- Fit, all data (this work) log(LHa)>40.54, Shioya et al. (2008) 



Figure 12. Angular two-point correlation function for bright Ha emitters 
(Lna > 10^^-^ erg s“^) at z ~ 0.2. The best fit power law relation is: ft)(0) = 
(0.109 ± 0.005) . For comparison, we plot the results for fainter 

emitters (Lna < 10^^-^ erg s“^) from Shioya et al. (2008). We find that more 
luminous Ha emitters are more clustered. 


Table 6. Two-point correlation function for Ha emitters at z ~ 0.2. Best fit 
as a single power law of the form ft)(0) = Note taht the filters and 
redshift distribution is different for Shioya et al. (2008) than for our study, 
so the amplitudes cannot be directly compared. 


★ z~0.2 (this work) - DM haloes, Mmin=10”Mo 

O z~0.24(Sobral etal. 2010) DM haloes, 

□ z~0.84(Sobral etal. 2010) dm haloes, 

A z~2.23 (Sobral et al. 2010) 



Figure 13. The dependence of the clustering length ro on redshift, using a 
consistent set of Ha emitters selected through NB surveys. For comparison, 
we are also showing data from Sobral et al. (2010). The plot suggests that 
typical (Ty^) emitters have very similar ro across cosmic time. At z ~ 0.2, 
there is a sharp increase in the typical DM halo mass with luminosity of the 
Ha sample. Note however, as shown in Figure 14, that once corrected for 
the redshift evolution of the characteristic luminosity, L^a sets the position 
of galaxies in relation to DM halo host. 


Source 

log(LHa) (erg s 

A 


This study 

41.00-42.40 

0.159±0.012 

-0.75 ±0.05 

Faint 

41.00-41.55 

0.208 ±0.035 

-0.61 ±0.07 

Bright 

41.55-42.40 

0.295 ±0.026 

-0.87 ±0.06 

Shioya et al. (2008) 

40.54-41.50 

0.019 ±0.004 

-1.08 ±0.05 

Shioya et al. (2008) 

39.40-40.54 

0.011 ±0.002 

-0.85 ±0.05 


sations of the LF give wildly different results, with values spanning 
4 — 5 dex. This is driven by two main factors: low number statis¬ 
tics and cosmic variance. The low number of Ha emitters in small 
volumes imposes wide and few Lh^ bins to gain enough number 
statistics. With few bins, the LF function is barely constrained. Ad¬ 
ditionally, small volumes do not fully sample the LF at the brightest 
Lhck, where Ha emitters are rare. Therefore, when the volumes are 
small cosmic variance is significant. However, with the increase of 
the probed volume, we can much better constrain 0* and pa¬ 
rameters, by overcoming both Poissonian errors and cosmic vari¬ 
ance. This is exemplified in Figure 11 . The standard deviation of 
the and 0* parameters at each volume size becomes smaller 
with increasing volume. Note however the values of and 0* 
are highly correlated (Figure Bl). 

As shown in this section, cosmic variance can fully explain the 
differences found in the literature regarding the Ha LF at z 0.2. 
By accounting for cosmic variance our LF results can be reconciled 
with those of Drake et al. (2013) and Ly et al. (2007). Our results 
indicate that at z ~ 0.2, volumes of at least 10^ Mpc^ are required 
to overcome cosmic variance. 


5 CLUSTERING OF Ha EMITTERS 

To study the clustering of our sample of 220 bright Ha emitters at 
z ^ 0.2, we start by generating a random catalogue with 1 million 
sources. The random catalogue sources follow the geometry of the 


actual observed fields and masked areas (due to saturated stars) and 
their number in each CCD of each pointing is normalised according 
to the depth attained (and hence the density of sources in that area). 

We follow the method described in detail in Sobral et al. 
(2010), which evaluates the two-point angular correlation function 
minimum variance estimator proposed by Landy & Szalay (1993): 


cu(0) = l + 



^ DD(e) iVR DR(e) 
RR(0) Vd RR(0) ’ 


(15) 


where 9 is the angle on the sky and Nr and No are the number of 
sources in the random and real catalogue of Ha sources. DD(0), 
RR(0) and DR(0) are the number pairs of sources located at dis¬ 
tances between 9 and 9 59 in the real data, random data and 

between real and random data, respectively. 

Errors on ft)(0) are then (Landy & Szalay 1993): 


Aco{9) 


l + cn(0) 

v/ddM' 


(16) 


We determine ft)(0) using 1000 different randomly selected 
sub-samples of sources selected from the randomly generated cat¬ 
alogue. We perform our analysis separately on emitters selected in 
each filter, but combine the data for the SA22, W2 and XMMLSS 
fields. We use the full luminosity range (Lh^ = erg s“^) 

of the Ha emitters, as well as split the sample in two roughly equal 
halves: a faint sample with luminosities in the range io4i O-4i.55 
erg s“^ and a bright one with luminosities io4i-55-42.40 
We bin the data using a range of angular scale bins (with different 
starting bin 0jnin^ bin width 59 and maximum bin 0max)- 

The results are presented in Figure 12 and Table 6. The two- 
point correlation function for the samples is well described by a 
single power law. The results for the two filters are considered sep¬ 
arately and when combined give fully consistent results within the 
error bars. 

Note we studied only the range 0.02 deg < 0 < 3.0 deg, where 
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Figure 14. The minimum DM halo mass (Mdm) as function of luminosity (Tna, left) and luminosity scaled by the characteristic luminosity at the respective 
redshift (LHa/T^^(z), right). The data from Sobral et al. (2010), split per luminosity bin, are shown for comparison. The ~ 0.2 points are renormalised using 
the derived in this paper. All luminosities are not corrected for intrinsic dust extinction. Note the relation between the Ha luminosity and host mass. When 
scaled for the typical luminosity, a clear relation between DM halo mass and luminosity is observed from z ~ 2.23 to z ~ 0.2. 


there was enough signal. At scales smaller than < 0.02 deg, a flat¬ 
tening of ft)(0) occurs, maybe caused by bright Ha emitters not 
being able to reside in a single halo. Additionally, since our survey 
is not very deep, we do not probe the regime where satellites are 
expected. Therefore, we cannot evaluate the departure of the two- 
point correlation function from a single power, which is caused by 
the transition from the large scale (two galaxies residing in sepa¬ 
rate dark matter halo) to the small scale clustering regime (galaxies 
sharing a single halo, e.g. Ouchi et al. 2005). 

Previous research indicates that bright Ha galaxies as well as 
Lyman break galaxies are more clustered than the faint ones. Sh- 
ioya et al. (2008) found that the two-point correlation function for 
faint Ha emitters (Lh^ < 10"^^-^^ erg s“^) at z ^ 0.24 follows the 
relationship: co{6) = (0.011 ± 0.002)while brighter 
emitters with lO'^^-^^ < Lh^ < 10"^^erg s“^ follow the relation¬ 
ship: m(0) = (0.019 ± 0.004)The amplitude of the 
two-point correlation function for our faint sample is 0.208 ±0.035, 
while for the bright sample it is slightly larger: 0.295 ± 0.026. The 
relation is also steeper for the bright sample than for the faint sam¬ 
ple. Our results therefore support and extend the claim that brighter 
(and hence more star-forming galaxies) are more clustered than 
faint ones to very high luminosities beyond 10^^-^ erg s“^ up to 

1042-4 g-1 _ 5 0). 

We use the inverse Limber transformation and the redshift 
distribution of the NB filters to translate the two-point correlation 
function into a 3D spatial correlation (Peebles 1980), assuming the 
latter is well described by e = (r/rp)^, where rp is the real-space 
correlation length of the Ha emitters. Following the method of So¬ 
bral et al. (2010), we assume that the two filters have a perfect top- 
hat shape. We compute ro for each realisation of ft)(0) in each filter, 
by fixing /3 = — 0.8. We finally combine the data for the two filters. 
The dependence of ro on redshift is shown in Figure 13. 

For the full sample, we obtain a correlation length ro = 3.3 
Mpc/h with a standard deviation 0.8 Mpc/h. We obtain ro = 3.5 ± 
1.1 Mpc/h for our fainter Ha sample and 5.0 ± 1.5 Mpc/h for the 
brighter one. Our measurements are larger than those of Sobral 
et al. (2010) at z ^ 0.24 (based on the sample from Shioya et al. 
(2008)), which find a value of 1.8 ± 0.2 Mpc/h for their sample with 
1039.4 ^ ^ 10"^^-^ erg s“^. As expected, fainter Ha galaxies 

have smaller correlation lengths than brighter ones (Norberg et al. 


2001; Shioya et al. 2008; Sobral et al. 2010). The correlation length 
also depends on redshift, but the evolution is driven by the typical 
luminosity: at high redshift, Ha emitters are on average brighter 
and have larger ro than lower redshift sources. 

Similar results are found by Hartley et al. (2010), who select 
galaxies using K band luminosity as proxy for stellar mass. The au¬ 
thors find that red galaxies, likely mostly ellipticals, are more clus¬ 
tered than the blue galaxies. Selecting star-forming galaxies based 
on colours, they find that ro drops with redshift. However, no de¬ 
pendence of ro on broad band luminosity was found. By contrast, 
Bielby et al. (2014) use a mass selected sample and find that higher 
mass galaxies tend to have larger clustering lengths. Additionally, 
they find that the clustering strength increases with stellar mass. 
Stellar mass correlates well with SFR (e.g. at z ~ 0.2 Stroe et al. 
2015), which can then be translated to an equivalent Ha luminos¬ 
ity though equation 11. The results from Bielby et al. (2014) may 
indicate that more star forming, more luminous galaxies have larger 
ro which is consistent with our findings. Note however that Sobral 
et al. (2010) controlled for both Ha luminosity and mass (K band 
luminosity) and found both are important for the evolution of ro: ro 
increases with both higher Lh^ and K band luminosity. 

The clustering of the Ha emitters depends on the clustering 
of their host dark matter (DM) haloes. The bias parameter b{z) de¬ 
scribes how the matter distribution traces the DM distribution, as 
function of redshift. In the bias model of Matarrese et al. (1997), 
the physical parameters of galaxies are determined by their host 
dark matter halo mass. In such a model, b{z) depends on the min¬ 
imum mass of the DM halo. Figure 13 also contain ro predic¬ 
tions for dark matter (DM) haloes with fixed minimum mass of 
Mjnin = as calculated by Geach et al. (2008) assuming 

a ACDM cosmology and an evolving bias model from Matarrese et 
al. (1997) and Moscardini et al. (1998). Note however, the ro pre¬ 
diction is highly dependent on the model, see for example Hartley 
et al. (2010). We thus note that while the trends are valid, the nor¬ 
malisation of the Mjnin could be higher than that used here, leading 
to lower masses than derived here. 

The emitters from Shioya et al. (2008), probing fainter Ha 
regimes with Lh^ < 1041.5 

erg s ^, reside in DM haloes of 10^^Mq 
mass. These are most likely dwarf galaxies. By contrast, our faint 
sample is hosted by DM haloes of about lO^^-^M© mass, about the 
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mass of the Milky Way. The bright Ha emitters are hosted by 
1013“13-5 ^q dm haloes, which are most probably already galaxy 
groups. 

Figure 14 shows how the DM halo minimum mass varies as 
function of Ha luminosity and the luminosity scaled by the char¬ 
acteristic luminosity at that redshift {Lua /By comparing 
our results, with the results from Sobral et al. (2010) (based on data 
from Shioya et al. (2008)), we find a linear correlation between the 
host minimum DM halo mass and luminosity (in log-log space, see 
Figure 14). This indicates that more luminous, more star-forming 
galaxies reside is more massive dark matter haloes. 

Accounting for the evolution of the characteristic luminosity 
with redshift, we find that more luminous emitters reside in more 
massive DM haloes, irrespective of redshift. Such a comparison 
between z < 0.4 and z > 4 samples has been previously difficult 
because of the different ranges probed in the differ¬ 

ent redshift ranges. With our measurements, we probe beyond 
at z 0.2 for the first time to be fully comparable with samples 
up to z ^ 2.23. Our measurements therefore confirm the results 
from Sobral et al. (2010) and Geach et al. (2012), who find that 

galaxies reside in ^ 10^3 Milky Way size DM haloes, at 
all redshifts. The results indicate the the position of a star forming 
galaxies within the Ha luminosity function is dictated by the host 
DM halo mass, at all cosmic times since 2.3. 


6 CONCLUSIONS 

In order to constrain the evolution of the star-forming galaxies 
across cosmic time, large samples of sources are necessary. Such 
samples are available at high redshifts (z > 0.8) through NB se¬ 
lected Ha emitter samples which probe large volumes (> 10^ 
Mpc3) and overcome cosmic variance. However, at low redshifts 
(z < 0.8), large areas (> 15 deg^) need to be surveyed in order 
to match the volumes at high redshift. By carrying out the largest 
survey of Ha emitters at z ^ 0.2, we produce a luminosity func¬ 
tion describing typical galaxies within representative volumes of 
the Universe. With our large sample of bright emitters we study 
their distribution and clustering and place it in the context of the 
evolution of the SFRD throughout cosmic history. Our main results 
are: 

• The Ha luminosity function at z ^ 0.2 is well described by 

a Schechter function with log(0*) = —2.85 ±0.03 (Mpc“3) and 
log(L^^) = 41.71 ±0.02 (erg s“^). We find that previous studies, 
probing far smaller volumes, underestimate the characteristic lumi¬ 
nosity reconciled with our results if cosmic variance 

uncertainties are taken into account. For volumes typically probed 
in previous Ha works at z ^ 0.2 of < 5 x 10^ Mpc3, cosmic vari¬ 
ance can account to more than 50 per cent variance in the LF pa¬ 
rameters. 

• By assuming a 15 per cent AGN fraction, we derive a star for¬ 
mation rate density of Psfrd = 0.0094±0.0008 M© yr“^ Mpc“^ 

• We find significant cosmic variance in the distribution of the 
Ha emitters, but on average 1—4 bright (Lna > 10"^^erg s“^) 
Ha emitters are found per square degree. 

• We study the clustering of the Ha emitters. The two-point 

correlation function is well fit by a single power law ft)(0) = 
( 0.159 ±O.O12)0*^“^-^3±o.O 5)^ ^ spatial clustering length tq = 

5.0 ± 1.1 Mpc/h for the bright sample 

ro = 3.5 ± 1.1 Mpc/h for the faint sample (iQ^l-55-42.40 ^^.g 
Our results confirm that luminous, strongly star-forming galaxies 
are more clustered than those weakly star-forming. 


• We find that, at z ~ 0.2, the higher the SFR, the more massive 
the DM halo host is. When accounting for the redshift dependence 
of the characteristic Ha luminosity, there is no redshift dependence 
of the host mass, but a strong dependence on L\ia/. 
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APPENDIX A: SURVEY COMPLETENESS 

The method for studying the completeness is detailed in §4.1. The 
dependence of the completeness on line flux is shown in Figure Al. 


APPENDIX B: SURVEY COMPLETENESS 

The results of the resampling of the LF at z ^ 0.2 with different 
binnings is presented for a range of data selections. The faint end 
slope is flxed at —1.35 and —1.7 and 0 and L are fit using data 
selected from the two NB Alters independently and combined. The 
results are shown in Figures Bl, B2, B3 and B4. 
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Figure Al. Survey completeness as a function of Ha flux, plotted separately for each fleld and NB Alter used to select Ha candidates. Each curve is associated 
with the the completeness study for a different CCD chip within each pointing. The darker the colour the more completeness curves fall within that region. 
Note the XMMLSS fleld is signiflcantly more complete than the W2 fleld. 
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Figure Bl. As for Figure 9, but with different values of a and when using the data for the two Alters separately or together. 
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Figure B2. As Figure 11, but for data samples from the two NB filters independently. Note that similar results are found for the two filters, even when 
considered separately. 
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Figure B3. As Figure 11, but for data samples from the two NB filters independently. Note that similar results are found for the two filters, even when 
considered separately. 



Figure B4. As Figure 11, but for different a values. 


MNRAS 000, 000-000 (0000) 


























